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ABSTRACT 

Pnkp is the end-healing and end-sealing component 
of an RNA repair system present in diverse bacteria 
from many phyla. Pnkp is composed of three 
catalytic modules: an N-terminal polynucleotide 
5' kinase, a central 2',3' phosphatase and a 
C-terminal ligase. The phosphatase module is a 
Mn^^-dependent phosphodiesterase-monoesterase 
that dephosphorylates 2',3'-cyclic phosphate RNA 
ends. Here we report the crystal structure of the 
phosphatase domain of Clostridium thiermocellum 
Pnkp with Mn^^ and citrate in the active site. The 
protein consists of a core binuclear metallo- 
phosphoesterase fold (exemplified by bacterio- 
phage X phosphatase) embellished by distinctive 
secondary structure elements. The active site 
contains a single Mn^^ in an octahedral coordination 
complex with Asp187, His189, Asp233, two citrate 
oxygens and a water. The citrate fills the binding 
site for the scissile phosphate, wherein it is 
coordinated by Arg237, Asn263 and His264. The 
citrate invades the site normally occupied by a 
second metal (engaged by Asp233, Asn263, His323 
and His376), and thereby dislocates His376. A con- 
tinuous tract of positive surface potential flanking 
the active site suggests an RNA binding site. The 
structure illuminates a large body of mutational 
data regarding the metal and substrate specificity 
of Clostridium thermocellum Pnkp phosphatase. 

INTRODUCTION 

RNA repair enzymes capable of sealing 2',3'-cyclic phos- 
phate and 5'-0H ends are present in diverse taxa in all 
phylogenetic domains of life (1). Repair pathways 
anchored by 'classic' ATP-dependent RNA hgases entail 
the joining of 3'-OH and 5'-P04 termini. To satisfy the 



ligase specificity for 3'-OH/5'-P04 ends, the original 
broken 2',3'-cyclic phosphate and 5'-OH ends must be 
'healed' before they can be sealed. Healing entails two 
discrete reactions: hydrolysis of the 2',3'-cychc phosphate 
by a phosphoesterase enzyme to form a 3'-OH, and phos- 
phorylation of the 5'-OH by a polynucleotide kinase 
enzyme to form a 5'-P04. 

The Pnkp-Henl RNA repair system of Clostridium 
thermocellum (Cth) is encoded in an operon-like gene 
cassette (Figure lA). Pnkp-Henl systems are present in 
diverse bacteria from 10 different phyla. C?/iPnkp is the 
end-heahng and end-sealing component and is composed 
of three catalytic domains: N-terminal kinase, central 
phosphatase and C-terminal hgase (2). The kinase 
module catalyzes phosphoryl transfer from ATP to the 
5'-OH RNA end. The phosphatase domain releases Pi 
from 2'-P04, 3'-P04 or 2',3'-cyclic phosphate 
ribonucleotides (Figure IB). The Hgase domain reacts 
with ATP to form a covalent enzyme-AMP adduct, just 
as classic RNA hgases do during strand joining, but it is 
unable per se to seal RNA strands (2,3). The sealing 
function of the Pnkp ligase domain is activated by the 
N-terminal half of the Henl protein (4). The C-terminal 
half of Henl is an autonomous Mn^"^-dependent 
3'-terminal ribose 2'- 0-methyl transferase that installs a 
2'-OCH3 mark at the RNA repair junction before 
ligation, and thereby immunizes the sealed RNA against 
recurrent endonuclease cleavage (Figure IB) (5-8). 

The physical and functional properties of the Pnkp- 
Henl RNA repair system are distinctive vis d vis others 
that have been characterized (1,9,10). Thus, it is of interest 
to dehneate the structure of the Pnkp-Henl holoenzyme 
and, en route, its constituent catalytic domains, to under- 
stand the principles of catalysis and substrate recognition, 
the relatedness of the Pnkp and Henl domains to other 
nucleic acid repair enzymes and the logic of the domain 
organization and its implications for orchestrating the 
heahng and sealing steps. 

To date, crystal structures have been solved for the 
isolated C/ZiHenl C-terminal methyltransferase domain 
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Figure 1. The Pnkp-Henl RNA repair pathway. (A) The adjacent co-oriented ORFs encoding the Henl and Pnkp polypeptides comprise a bacterial 
RNA repair cassette conserved in taxa from 10 different bacterial phyla. Pnkp is a trifunctional RNA repair enzyme composed of 5'-OH poly- 
nucleotide kinase, 2', 3' phosphatase and ligase domains. Henl is composed of a C-terminal methyltransferase catalytic domain (MTase) fused to a 
unique N-terminal module that binds to the Pnkp ligase domain. (B) 2',3'-cyclic phosphate and 5'-OH ends are substrates for healing and sealing by 
Pnkp and Henl. The 5' end is phosphorylated by the Pnkp kinase, and the 2',3'-cyclic phosphate is removed by the Pnkp phosphatase. The Henl 
methyltransferase installs a 2'-OCH3 mark at the terminal ribonucleoside before ligation of the ends. The ligation reaction is mediated by a 
heterodimer of the Pnkp ligase domain and the Henl N-terminal domain. The repair junction with the methyl mark is then resistant to scission 
by transesterifying endoribonucleases. 



(a monomer) bound to the reaction product 5-adeno- 
sylhomocysteine (6), the isolated C/ZzPnkp C-terminal 
ligase domain (a monomer) in a complex with the ATP 
substrate and as the covalent hgase-AMP intermediate 

(3) , and the Hgase-AMP»Mg^^-Henl(N) heterodimer 

(4) . The structure of the hgase domain is distinctive, 
being composed of a classic hgase adenylyltransferase 
fold that is embellished by a unique a-hehcal insert 
module and a unique C-terminal a-helical module (3). 
The insert module of the ligase domain mediates the inter- 
action with the N-terminal domain of Henl, which is itself 
a novel tertiary structure (4). 

The N-terminal 170-amino acid segment of C/APnkp 
is an autonomous polynucleotide kinase module. We 
recently reported the crystal structure of the kinase 
domain (a homodimer) bound to ATP»Mg^"^ and 
ADP«Mg^"^, reflective of substrate and product 
complexes, respectively, and used these structures to 
guide a mutational analysis of the kinase active site (11). 
Our findings highhghted structural and functional 
similarities to another RNA repair enzyme, phage T4 
polynucleotide kinase (12,13), while revealing unique 
structural features of the bacterial polynucleotide kinase 
clade. 

The 2', 3' phosphatase component of C/APnkp has been 
the subject of extensive biochemical characterization and 
mutational analyses (2,14-17). The enzyme is a Ni^^/ 
Mn^^-dependent phosphodiesterase and monoesterase, 
active on nucleotides (2',3'-cyclic AMP, 3' AMP and 2' 
AMP) and generic substrates (bis-;7-nitrophenyl 



phosphate, /i-nitrophenyl phosphate and />nitrophenyl 
phenylphosphonate). The phosphodiesterase and mono- 
esterase reactions rely on overlapping but different ensem- 
bles of active site functional groups. The enzyme is 
remarkably plastic, insofar as the C/ZzPnkp phosphatase 
can be transformed toward narrower metal and substrate 
specificities via mutations of the active site. For example, 
certain changes transform C/APnkp into a Mn^^-depend- 
ent phosphodiesterase devoid of monoesterase activity 
(15-17). The RNA 2',3'-cyclic phosphodiesterase activity 
of C//iPnkp is thought to be most pertinent to RNA repair 
in hght of the fact that 2',3'-cyclic phosphate termini are 
the predominant products of several known RNA damage 
pathways (17). 

The primary structure of the C?/iPnkp phosphatase do- 
main identifies it as a binuclear metaUophosphoesterase — 
a vast enzyme superfamily distributed widely among taxa. 
A prototypal member is bacteriophage A phosphatase 
(X-Pase), which has been characterized structurally and 
biochemically (16,18-21). A-Pase uses Mn^^ to catalyze 
phosphoester hydrolysis with a variety of substrates, 
including phosphopeptides, phosphoproteins, nucleoside 
2',3'-cyclic phosphates and generic organic phosphomo- 
noesters and diesters such as /j-nitrophenyl phosphate 
and bis-/i-nitrophenyl phosphate. Yet, the physiological 
substrate(s) and biological function of X-Pase remain 
obscure. To our knowledge, only one other metaUopho- 
sphoesterase enzyme besides C/ZzPnkp plays a dedicated 
role in RNA metabohsm, that being the RNA debran- 
ching enzyme Dbrl (22). Here we report the crystal 
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structure of the Cf/iPnkp phosphatase domain, which 
verifies its membership in the metallophosphoesterase 
superfamily, while highlighting structural features that 
distinguish it from 1-Pase. 

METHODS 

Purification of aAPnkp-(171-424) 

A pET-based expression plasmid encoding the phosphat- 
ase domain fused to an N-terniinal HisioSmtS module and 
a C-terminal Hisg-tag was transformed into E. coli 
BL21(DE3). A 4-1 culture derived from a single kanamy- 
cin-resistant transformant was grown at 37°C in Luria- 
Bertani medium containing 60 |ig/ml kanamycin until the 
^600 reached 0.6. The culture was then adjusted to 0.3 mM 
isopropyl-P-D-thiogalactoside and 2% (v/v) ethanol, and 
incubated for 17h at 17°C with continuous shaking. Cells 
were harvested by centrifugation, and the pellet was stored 
at — 80°C. All subsequent procedures were performed at 
4°C. Thawed bacteria were resuspended in 130 ml of lysis 
buffer (50mM Tris-HCl, pH 7.5, 1.2 M NaCl, 25 mM 
imidazole, 10% glycerol). Lysozyme, PMSF, benzamidine 
and Triton X-100 were added to final concentrations of 
1 mg/ml, ImM, 1 niM and 0.2%, respectively. After 
mixing for ~50 min, the resulting lysate was sonicated to 
reduce viscosity, and insoluble material was removed by 
centrifugation for 45 min at 1 7 000 rpm in a Sorvall SS34 
rotor. The soluble extract (130 ml; 2.7 g of protein) was 
applied to a 13-ml column of Ni-nitrilotriacetic acid- 
agarose (Qiagen) that had been equilibrated with lysis 
buffer. The column was washed sequentially with 30 ml 
of 3M KCl in buffer A (50 mM Tris-HCl, pH 7.5, 
200 mM NaCl, 10% glycerol) and 30 ml of lOOmM imid- 
azole in buffer A. The bound protein was serially step- 
eluted with buffer A containing 200 mM and 300 mM 
imidazole. The elution profile was monitored by SDS- 
PAGE. The peak phosphatase-containing fractions were 
pooled (185ml; 550nig of protein). A 60-ml aliquot con- 
taining IBOmg of protein was supplemented with 0.36 mg 
of His-tagged Smt3-specific protease Ulpl and 1.5 ml of 
200 mM EDTA. The mixture was then dialyzed overnight 
against 41 of 20 mM Tris-HCl, pH 7.5, 100 mM NaCl, 
ImM EDTA, and 10% glycerol. SDS-PAGE analysis of 
the dialysates indicated that the HisioSmt3 module had 
been cleaved from the phosphatase domain. The dialysate 
was adjusted to 50 mM imidazole and then applied to a 
7-ml phosphocellulose column that had been equihbrated 
with 20 mM Tris-HCl, pH 7.5, 70 mM NaCl, 50 mM imid- 
azole, and 10% glycerol. The phosphatase was recovered 
in the flow-through fractions, free of the HisioSmt3 
protein, which was retained on the column. The phosphat- 
ase was concentrated by centrifugal ultrafiltration to 
8.3 ml of an 11.5 mg/ml solution. Two separate ahquots 
(4 ml each) were gel-filtered through a Superdex-200 
column equihbrated with 20 mM Tris-HCl, pH 7.5, and 
70 mM NaCl, lOyo glycerol; the A2^o profile revealed a 
single peak of included protein at a position consistent 
with a monomer. The peak fractions were pooled and 
concentrated by centrifugal ultrafiltration to 3.4ml of a 
24 mg/ml solution and stored at — 80°C. 



Crystallization 

The phosphatase solution was adjusted to lOniM MnCl2 
and incubated for 10 min before ahquots (2 ^l) were mixed 
on a cover slip with an equal volume of precipitant 
solution containing lOOmM Na-citrate, pH 5.6, 33.5% 
(w/v) PEG-3350, and 15% DMSO. Crystals were grown 
at 22°C by hanging-drop vapor diffusion against a reser- 
voir of the same precipitant solution. Rod-shaped crystals 
appeared after 1 d; single crystals were frozen directly in 
hquid nitrogen. 

Diffraction data collection and structure determination 

The structure of C?/iPnkp-( 17 1-424) was solved using 
single-wavelength anomalous diffraction (SAD) data 
from a single crystal as follows. Diffraction data were col- 
lected at NSLS beamhne X25 using a Pilatus-6M detector 
at a wavelength of 1.846 A intended to maximize anomal- 
ous scattering from the manganese atoms that we 
presumed were bound to protein. Data at 2.9-A resolution 
were collected in three continuous sweeps (1° increments) 
at three different crystal orientations to give a total of 933 
images that were processed in HKL2000/SCALEPACK to 
give the highly redundant phasing dataset described in 
Table 1 (top^ right column). A second higher- resolution 
dataset (1.6 A) used for refinement was collected on a 
second crystal at a wavelength of 1.1 A. Data from 
two continuous sweeps comprising 185 1° increments 
and 200 0.5° increments were processed using XDS (23) 
to give the refinement dataset described in Table 1 (top left 
column). Both crystals belonged to space group P2i with a 
unit cell volume consistent with two C//zPnkp-(171^24) 
protomers per asymmetric unit, assuming a solvent 
content of 40%. The crystals were isomorphous. The 
phasing dataset was truncated at a diffraction maximum 
of 2.9 A owing to the crystal-to-detector distance used; the 
refinement dataset was complete to a diffraction limit of 
1.6A. 

Phases for Cf/!Pnkp-(171^24) crystals were obtained 
via PHENIX.AUTOSOL (24). Within AUTOSOL, the 
HYSS subroutine found two manganese sites; their 
coordinates were consistent with the translational non- 
crystallographic symmetry (NCS) observed via native 
Patterson analysis. Using phases from these initial two 
sites, the PHASER subroutine found four additional 
low-occupancy sites. Initial phases using all six sites were 
improved via density modification in the RESOLVE 
subroutine using NCS averaging. The initial model 
built by the AUTOBUILD subroutine contained 326 
residues in two protomers and revealed the low-occupancy 
sites identified by PHASER to correspond to sulfur atoms 
of Cysl91 and Met251 of each protomer. This initial 
model was then refined against the high-resolution 
dataset, extended further in AUTOBUILD and iteratively 
rebuilt by hand in COOT (25) and refined in 
PHENIX.REFINE. The final model had Rfree/Rwork 
values of 20.8 %/l 6.9%, with excellent geometry, no 
Ramachandran outliers and no large Fq-Fc difference 
Fourier peaks (Table 1). 
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Table 1. Crystallographic data and refinement statistics 



Refinement dataset 



Phasing dataset 



Space group 

Unit cell dimensions (A) 



130K 



Diffraction data quality 
Resolution, A 

Radiation source/wavelength 
Processing software 

Rsym 1 °/o 

Unique reflections'' 

Mean redundancy 

Completeness, % 

Mean I/al 
Phasing statistics 

Phasing method 

Resolution, A 

Signal to noise"^ 

Figures of merit'' 
Refinement and model statistics (refinement dataset, F>1.34aF^ 

Resolution (A) 

Completeness, % 

-/^free /-^^work. % 

RMSD: bonds/angles 
Protomers per ASU 
NCS RMSD 
NCS restraints 
Ramachandran plot: 
B-factors, A^ 
Overall/Wilson 
Protein (by chain) 
Mean TLS anisotropy 
PDB ID 
Model contents 
Protein residues 
Heteroatoms 
Water 



P2, 

a = 66.70 
b = 38.66 
c = 92.95 
P = 91.80° 

100.0-1.6 (1.64-1.6) 
NSLS X25/1.1 A 
XDS 
3.9 (50.6) 
62 980 (4694) 
4.68 (2.75) 
98.8 (89.9) 
19.39 (2.04) 



P2, 

a = 67.03 
b = 38.52 
c = 92.07 
P = 91.96° 

50.0-2.9 (3.0-2.9) 
NSLS X25/1.846A 
HKL2000/Scalepack 
8.8 (21.4) 
20 599 (2041) 
7.3 (6.5) 
99.9 (99.6) 
38.42 (16.30) 



SAD (2 Mn^ 

46.0-3.1 

0.12 

0.39/0.74 

37.6-1.6 (1.62-1.60) 
99.0 (89.0) 
20.8/16.9 (24.1/25.9) 
0.013 A/1.68° 
2 

0.53 A over 254 C„ 
None 

94.6% favorable, no outliers 

40.8/27.0 
37.4/44.4 

0.35 (16 TLS groups) 
4J60 

273(A) + 264(B) 

2 Mn^^, 3 citrate, 4 polyethylene glycol, 8 glycerol 
353 



4 sulfur) 



Standard definitions are used for all parameters. Figures in parentheses refer to data in the highest resolution bin. The refinement and geometry 
statistics come from PHENIX. 

"Rsym output as as Rf„cior by XDS and R,,„ by SCALEPACK. 

''F^ and F^ were treated as equivalent in the refinement dataset and independent for the phasing dataset. 

■"'Signal to noise ratios for anomalous phasing represents the fraction of data for which D.^„„ > 3aDii„o as output by XTRIAGE. 
''Figures of merit are from PHASER/RESOLVE for heavy atom phasing. 

"Rita, dataset consisted of 5% of data chosen at random against which the structure was not refined. 



RESULTS AND DISCUSSION 

Crystallization of the phosphatase domain of CthPnkp 

The N-terminal segment of CthPnkp, spanning residues 1- 
425, is active as a polynucleotide kinase and a 2', 3' 
phosphoesterase (2). The central segment C^/!Pnkp-(171- 
424) comprises an autonomous phosphatase domain (14). 
Here we grew crystals of C?/!Pnkp-(171^24) by hanging- 
drop vapor diffusion after mixing a sample of the protein 
solution containing 0.8 mM phosphatase and 10 mM 
MnCl2 with an equal volume of precipitant solution con- 
taining 100 mM Na-citrate, pH 5.6, 33.5% PEG-3350, and 
15% DMSO. The crystals diffracted well (up to 1.6-A 
resolution) and belonged to space group P2i. SAD 
phasing located two manganese atoms and four sulfur 
atoms in the asymmetric unit, which contained two phos- 
phatase protomers. The refined model of the phosphatase 
at 1.6-A resolution (Rwork/Rfree = 16.9%/20.8%; Table 1) 
consisted of a continuous polypeptide chain with excellent 



geometry and no Ramachandran outliers. A single Mn 
and a citrate anion were present in the active site of each 
protomer. The A and B protomers superimposed with a 
root-nieaii-square deviation (RMSD) of 0.53 A at 254 Cot 
positions. The phosphatase protomers are monoineric in 
the crystal. 

Overview of the phosphatase structure 

Two views of the tertiary structure of the phosphatase are 
depicted in Figure 2A; helices are colored cyan, and p 
strands are magenta. The secondary structure elements 
are shown above the amino acid sequence in Figure 2D. 
The phosphatase fold consists of a central (3 sandwich 
composed of a six-strand p sheet with topology pij,- 
p7;-P8t-P9t-Pllt-P10^^ (on the left side of the 
sandwich in Figure lA) and a five-strand P sheet with 
topology P6t-P5t-P2t-P12|-P13t (on the right side of 
the sandwich). The folding topology is diagrammed in 
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Figure 2. Tertiary structure of the C//iPnkp phosphatase domain and comparison with X phosphatase. (A) The tertiary structure of protomer B of 
the Cz/jPnkp phosphatase (C//7Pase) is shown in two orientations. The N and C termini and selected secondary structure elements are labeled in 
the left panel. Citrate (stick model) and Mn^^ (magenta sphere) are bound in the active site. (B) The structure of A phosphatase (protomer C from 
pdb 1G5B) with sulfate and Mn^^ in the active site was superimposed on the C^/iPase structure (the view at right in panel A) and then offset 
horizontally. (C) The surface models of Cr/iPase and vacuum electrostatics were generated in Pymol. The citrate in the active site is shown as a stick 
model. The middle view is looking down on the active site. The left/right views are rotated clockwise/counterclockwise to highlight the stalk-like 
projection of the Cth-loop from the enzyme surface flanking the active site. (D) The aligned primary structures of the Cth and X phosphatases are 
shown, with amino acids colored according to secondary structure (blue for helices and magenta for f) strands). Positions of side-chain identity/ 
similarity are denoted by black circle above the sequence. Gaps in the alignment are indicated by dashes. The secondary structure elements of Cth 
phosphatase are shown above the amino acid sequence, with (3 strands rendered as arrows and a helices as cylinders. (E) Folding topology of the 
Cf/jPnkp phosphatase. 
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Figure 2E. The (3 sheets are flanked by a helices 
(Figure 2A). Citrate and Mn^^ are bound in a pocket at 
the base of a deep canyon formed by the 17-amino acid 
loop connecting a helices 6 and 7 (Figure 2A and C). 

A DALl search (26) with the C//7Pnkp phosphatase 
identified numerous members of the binuclear metallopho- 
sphoesterase superfamily as homologs. Top DALI hits 
were a putative phosphatase from Trypanosoma briicei 
(pdb 2QJC; Z score, 23.8; RMSD of 2.6 A over 207 Ca 
positions), a putative diadenosine tetraphosphate hydro- 
lase from Shigella flexneri (pdb 2DFJ; Z score, 22.1; 
RMSD of 2.9 A over 206 Ca positions) and bacteriophage 
X phosphatase (pdb 1G5B; Z score, 21.9; RMSD of 2.3 A 
over 199 Ca positions). The superimposed tertiary struc- 
tures of Cz/iPnkp and A-Pase are shown side-by-side in 
Figure 2A (right panel) and B. Their primary and second- 
ary structures are ahgned in Figure 2D. The bacterial and 
phage enzymes have a common core fold, comprising the 
central p sandwich and several of the flanking a helices. 

There are three noteworthy differences between the bac- 
terial and phage phosphatases: (i) the presence in C/APnkp 
of a 17-amino acid p-hairpin module, composed of strands 
P3 and P4 and the intervening loop (Figure 2A), that has 
no counterpart in X-Pase; (ii) the 'Cth-loop' and helix a7 
that have no equivalent in A-Pase (Figure 2A and B); and 
(iii) the presence in Pnkp of a C-terminal pi3 strand at the 
outer edge of the flve-strand P sheet that is absent in X- 
Pase (Figure 2D and E). 

The Cth-loop forms a distinctive stalk-hke protrusion 
from the enzyme surface next to the active site 
(Figure 2C). The loop is rich in acidic side chains 
(Glu346, Glu350, Asp352, Glu353, Asp361) that impart 
a negative surface electrostatic potential (colored red in 
Figure 2C) to this part of the enzyme. The Cth-loop is 
present and conserved in length and primary structure in 
10 other Pnkp enzymes from diverse taxa representing 
10 different bacterial phyla (Figure 3B). A stereo view of 
the structure of the a6-loop-a7 segment is shown in 
Figure 3A, with conserved side chains and their inter- 
actions highlighted. The base of the loop is stabilized by 
(i) van der Waals contacts between residues Tyr341 and 
Trp362 and between Glu346 and Tyr360, (ii) a hydrogen 
bond from Tyr360 to Asp365 and (in) main-chain 
hydrogen bonds from Tyr360-N to Glu346-0 and from 
Thr348-N to Val358-0. The top of the loop is stabilized 
by (i) main-chain hydrogen bonds between Glu350-O and 
Val358-N and from Asp352-N to Leu356-0, and (ii) a trio 
of hydrogen bonds between the Asp352 carboxylate and 
the main-chain amide nitrogens of Tyr354, Gly355 and 
Leu365 (Figure 3A). We speculate that the Cth-loop 
may undergo a conformational switch on binding to an 
RNA substrate, whereby its concave surface that forms 
the canyon wall next to the active site closes over the poly- 
nucleotide to clamp it in place. Alternatively, the Cth-loop 
might facihtate relocation of the healed RNA 3' end to 
other enzymatic active sites (e.g. methyltransferase or 
ligase) during RNA repair. 

The distinctive P3-P4 hairpin module is disposed on the 
inferolateral surface of the enzyme remote from the active 
site (Figure 2A). The inward-facing side chains of the p3 
and P4 strands are hydrophobic (Ile206, Tyr218, Val220) 



and make van der Waals contacts with hydrophobic 
residues emanating from helices al and a2 (Tyrl92, 
Leu 199, Leu247, Met251). The outward-facing amino 
acids of the P3-p4 hairpin are mainly hydrophilic 
(Glu207, Glu210, Arg291, Thr221), consistent with them 
being solvent exposed in the context of the Pnkp-Henl 
complex. 

The C-terminal pi 3 strand (highhghted in yeUow in 
Figure 4A) and flanking amino acids are an integral elem- 
ent of the Pnkp phosphatase fold. In particular, residues 
Arg405, Tyr406, Glu408, Glu410 and Tyr420 are 
conserved among bacterial Pnkp proteins (Figure 4B) 
and engage in a wide network of hydrogen bonding and 
electrostatic interactions that tether the pi2-pi3 hairpin 
to other secondary structure elements of the phosphatase 
(Figure 4A). 

The central panel in Figure 3C shows that citrate 
(a mimetic of the scissile phosphate) occupies a deep 
pocket at the canyon's base. The negative electrostatic 
surface of the pocket reflects its function in binding the 
metal cations essential for catalysis. The citrate sits above 
the Mn^^ atom in the active site. The enzyme surface to 
the right of the active site pocket is notable for its positive 
electrostatics (colored blue). The positive potential extends 
around the enzyme surface as the protein is rotated coun- 
terclockwise (the rightmost view in Figure 2C), suggestive 
of an RNA-binding track. We speculate that RNA end 
heahng by the Pnkp phosphatase entails anchoring of 
the RNA 2', 3' phosphate anion in the metal-bound 
pocket while the flanking 3'-terminal RNA segment 
wraps around the blue enzyme surface. 

The phosphatase active site 

The Fq-Fc maps revealed electron density for citrate in the 
active site; the composite omit map for this citrate and 
nearby elements is shown in Figure 5A. The strong anom- 
alous peak over the neighboring metal ion validated its 
assignment as manganese (Figure 5A). A detailed view 
of the active site architecture and the atomic contacts to 
citrate and Mn'^^ is shown in Figure 5B. Two of the sites in 
the Mn^^ coordination complex are occupied by citrate 
oxygens. The other four sites are filled by Hisl89-Ns, 
Aspl87-052, Asp233-052 and a water coordinated by 
Asp392-0§2. The atomic contacts of Mn^+ in the 
citrate-bound C/ZiPnkp phosphatase are virtually identical 
to those of the corresponding Mn^^ in A-Pase (shown in 
Figure 5C). The 1-Pase structure captures the enzyme with 
a sulfate anion in the active site (21) in a state that mimics 
a product complex of the phosphomonoesterase reaction. 
The sulfate Ol atom coordinated between the two Mn'^^ 
ions in the >^-Pase structure corresponds to the metal- 
bound OH nucleophile in the Michaelis complex before 
phosphohydrolase chemistry (exemplified by the structure 
of Mycobacterium tuberculosis Rv0805; pdb 2HY1) (27). 

The citrate C6 atom (colored yellow in Figure 5B) 
superimposes on the sulfate sulfur atom in A-Pase 
(Figure 5C). Atomic contacts to the citrate 03, 05 and 
07 atoms made by the Arg237 Asn263 and His264 side 
chains in C^APnkp hew to those made by X-Pase residues 
Arg53, Asn75 and His76, respectively, with the sulfate 
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Figure 3. Structure and conservation of the Cth-loop. (A) Stereo view of tlie distinctive a6-loop-0(7 segment of Cr/iPnkp phosphatase. Conserved 
amino acids are shown as stick models. (B) The primary structure of the a6-loop-a7-p9 segment of C?/iPnkp phosphatase is aligned to the 
equivalent segments of Pnkp proteins from bacteria representing 10 phyla: De.iidfo.'iporosinus yoiingiae (Dyo, Firmicutes), Myxococcus slipitatus 
(Mst, Proteobacteria), Ktedonohacter racemifer (Kra, Chloroflexi), Treponema denticola (Tde, Spirochaetes), Anahaena variabilis (Ava, 
Cyanobacteria), Bergeyella zoohelcum (Bzo, Bacteriodetes), Schle.sneria paludicola (Spa, Planctomycetes), Micromonospora aurantiaca (Mau, 
Actinobacteria), Deinococcus gobiensis (Dgo, Deinococcus-Thermus) and Candidatus Koribacter versatili.i (Cko, Acidobacteria). Amino acids are 
colored according to the Cth secondary structure (blue for a helices and magenta for p strands). Positions of side chain identity /similarity in all of the 
aligned Pnkps are denoted by black circle above the sequence. The active site His376 residue is denoted by vertical line. 



anion. Thus, we surmise that Arg237, Asn263 and His264 
engage the scissile phosphate during catalysis by the 
CthPnkp phosphatase and, together with the metal(s), sta- 
bilize the transition state and promote expulsion of the 
ribose hydroxyl leaving group. 

The salient feature of the CtliPnkp active site vis d vis X- 
Pase is that filling the phosphate binding site with citrate 
precludes occupancy of the second metal site. Specifically, 
the citrate C2, CI, Ol and 02 atoms encroach on the 
second Mn^^ site (Figure 5A and B). This site in 
Cf/iPnkp, which can be inferred with high confidence 
from the X-Pase structure, would be formed by Asp233, 
His323, His376 and Asn263-0§. When citrate is bound, 
its Ol atom makes a hydrogen bond with His323-N£ 
(Figure 5B). Also, the citrate CI, Ol and 02 carboxylate 
dislocates the "^His-Thr"'' dipeptide iii the p9-pl0 loop, 
displacing the His376-N5 atom 3.2 A away from its 
position in the A-Pase Mn^"^ coordination complex 
(see His376 in Figure 5B versus His 186 in Figure 5C). 



In effect, the citrate»Mn structure of CthPnkp 
exempUfies an inhibitor complex. 

Structural insights to mutational data 

Initial mutational analysis of the C//7Pnkp phosphatase by 
alanine scanning identified Asp 187, His 189, Asp233, 
Arg237, Asn263, His264, His323, His376 and Asp392 
as essential for its Ni^"^-dependent ribonucleotide-3'- 
phosphomonoesterase and ribonucleotide-2'-phospho- 
monoesterase activities (14). Introducing conservative 
substitutions for these residues (e.g. Asn or Glu for Asp, 
Asn or Gin for His, Gin or Asp for Asn) failed to restore 
the ribonucleotide phosphomonoesterase activities, except 
for the replacement of His264 by glutaniine (14). 
Subsequent studies focused on the Mn^"^-dependent 2',3'- 
cyclic phosphodiesterase activity of CthPnkp, which is the 
most pertinent to its 3' end-healing function during RNA 
repair (17). Mutations of Asp 187 and Asp233 abohshed 
2',3'-cyclic phosphodiesterase activity (16), consistent with 
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Figure 4. Structural role of the distinctive pi 3 strand. (A) Stereo view highlighting the hydrogen bonding and electrostatic interactions of the C- 
terminal (313 strand (strand colored yellow) and flanking residues that tether the pi2-pi3 hairpin to other secondary structure elements of the 
phosphatase. (B) The primary structure of the C-terminal piO-pil-pl2-pi3 segment of O/jPnkp phosphatase is aligned to the equivalent segments 
of Pnkp proteins from bacteria representing 10 phyla (see Figure 3 legend). Positions of side-chain identity/similarity in all of the aligned Pnkps are 
denoted by black circle above the sequence. The active site His376 residue is denoted by vertical line. 



Aspl87 and Asp233 interactions witii one or both of the 
catalytic Mn^^ ions in the active site (Figure 5). Mutations 
of His264 reduced cycHc phosphodiesterase activity by an 
order of magnitude (16), highhghting the importance of its 
contacts to the scissile phosphate inferred from the citrate 
complex (Figure 5). Indeed, this phosphate-binding 
histidine was found to be a defining determinant of the 
ability of other binuclear inetaUophosphoesterases to act 
as 2',3'-cyclic nucleotide phosphodiesterases (27). 

Mutations of active site components His 189 and 
Arg237 exert remarkable effects on the cyclic phospho- 
diesterase activity of C/ZzPnkp that are worth discussing 
in light of the crystal structure. Arg237 makes a bidentate 
contact with the citrate that presumably reflects its inter- 
action with the scissile phosphate (even allowing for 
adoption of an alternative arginine rotamer as seen the 
l-Pase«S04 complex). R237A, R237K and R327Q muta- 
tions increased the for 2', 3' cAMP by 3- to 6-fold, 
thereby underscoring the importance of the phosphate 
contacts for substrate binding affinity. Yet, Arg237 was 
found to make little contribution to the maximal rate of 
2',3'-cyclic phosphate hydrolysis, insofar as the alanine 
change had no effect on k^ax. and the R237K and R237Q 



changes actually increased k^._^^ by 3-fold and 2-fold, re- 
spectively (16). The same trends were seen for the effects 
of R237A, R237K and R237Q on Mn^+-dependent hy- 
drolysis of the non-nucleotide phosphodiesterase substrate 
bis-/7-nitrophenyl phosphate, i.e. these changes increased 
the K.^ for substrate by 2- to 3-fold while increasing /Teat by 
2-fold (R237K), 10-fold (R237A) or 20-fold (R237Q) (15). 
We suspect that weakening the arginine»P04 interactions 
increases the product off-rate, thereby explaining the 
observed increases in ^cat for the mutant enzymes. The 
most impressive effect of the arginine mutations was to 
abolish the Mn^^-dependent phosphomonoesterase 
activity of Cf/iPnkp, thereby altering the reaction outcome 
by limiting it to a single step of phosphodiester hydrolysis 
instead of the sequential phosphodiesterase and mono- 
esterase reactions performed by the wild-type Cz/iPnkp 
(15). It is conceivable that the greater negative charge on 
the associative transition state of the phosphomonoester 
versus the diester demands neutralization by the arginine. 

Mutating the Mn^^ hgand His 189 to aspartate also 
has dramatic effects on Cf/zPnkp activity. The H189D 
change abohshes Ni^^-dependent phosphomonoesterase 
and diesterase activities and the Mn^^-dependent 
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Figure 5. Active site density and atomic contacts. (A) Stereo view of 
the Mn~^»citrate-bound active site of C/APnkp phosphatase showing 
the composite omit electron density in gray mesh (contoured at 
1.0 a, 0.25 -A grid, refinement dataset, 2.0 -A limiting resolution) and 
the anomalous difference Fourier density over the Mn^^ atom in red 
mesh (contoured at 10 a, 0.5 -A grid, phasing dataset, 3.0 -A hmiting 
resolution, model phases). The amino acids are shown as stick models 
with beige carbons. The citrate is a stick model with gray carbons. 
Mn^^ is a magenta sphere; water is a red sphere. The gray sphere 
indicates the position of the second Mn"^ site that is encroached by 
citrate, imputed by alignment to the structure of the di-Mn^^ complex 
of ^-Pase (see below). (B and C) Stereo views of the aligned active sites 
of the C^/;Pnkp phosphatase B protomer (panel B) and the A-Pase C 
protomer (panel C). Amino acids and citrate/sulfate are shown as stick 
models. The citrate carbon atom in panel B that occupies the same 
position as the sulfur atom in panel C is colored yellow. Mn^^ ions 
are magenta spheres. Waters in the metal coordination complex are red 
spheres. Atomic contacts are indicated by dashed lines. 



monoesterase activity, but spares the Mn "^-dependent 
phosphodiesterase functions of C//!Pnkp (15). H189D 
has no effect on the for 2', 3' cAMP, but increases 
the ^cat of the cyclic phosphodiesterase reaction by 
4-fold (16). The H189D mutation fundamentally repro- 
grams the tRNA repair pathway of C/AFnkp, causing it 



to convert a broken RNA 2',3'-cyclic phosphate end to an 
RNA 3'-OH,2'-P04 product instead of the RNA 3'-OH,2'- 
OH product generated by the wild-type enzyme (17). 
Altering the outcome of end healing can redirect the sub- 
sequent end sealing steps of RNA repair toward ligases 
with different end specificities (17). 

Concluding remarks 

The present study provides new insights to the structure 
and inechanism of the phosphatase domain of bacterial 
Pnkp by capturing the enzyme in a complex with Mn^"'" 
and citrate (a inimetic of the scissile phosphate). The core 
fold and the amino acid constituents of the active site are 
similar to those of 1-Pase, the prototypal binuclear 
metallophosphoesterase. The fold is einbellished by dis- 
tinctive Cth-specific loop and P-hairpin modules, with 
the Cth-loop being a good candidate to aid in RNA sub- 
strate recognition by virtue of its proximity to the active 
site. A continuous tract of positive surface potential 
flanking the C//iPnkp active site suggests a role for this 
surface in RNA binding. It is noteworthy that A-Pase lacks 
an analogous positive surface flanking its active site 
(not shown), which is in keeping with the idea that the 
physiological substrate for X-Pase (although unknown) is 
probably not RNA. 

The crystallized phosphatase domain of bacterial Pnkp 
is a monomer. In contrast, the crystallized kinase domain 
is a homodimer. The kinase-kinase homodimer interface 
can now be presumed to be the principal driver of the 
homodimeric quaternary structure of the isolated full- 
length C^APnkp protein (2) and of Pnkp in the context 
of the Pnkp-Henl RNA repair complex, which is a 
(Pnkp)2(Henl)2 tetramer (5). The Henl methyltransferase 
domain is a monomer (6, 7), and the hgase-Henl(N) 
complex is a 1:1 heterodimer (4). The parallel orientation 
of the kinase domain protomers in the kinase homodimer 
in the crystal structure, combined with aforementioned 
knowledge of the structures of all other component 
modules, supports a model for the (Pnkp)2(Henl)2 
tetramer (11) in which the central phosphatase domains 
reside on the same face of the kinase dimer and serve as 
Hnkers to pairs of LIG-Henl heterodimers. The next chal- 
lenge will be to crystallize the complete Pnkp-Henl 
complex, the structure of which can be solved based on 
the now full suite of high-resolution crystal structures of 
the component domains. 

To our knowledge, the Cz/jPnkp phosphatase is the first 
example of a structurally defined binuclear metallopho- 
sphoesterase dedicated to RNA repair or modification. 
Other RNA end-healing enzyines belong to unrelated 
enzyme superfaniilies (discussed in 28). To our knowledge, 
the only other binuclear metallophosphoesterase 
dedicated to RNA modification is the intron-debranching 
enzyme Dbrl that hydrolyzes the 2'-5' phosphodiester 
linkage of the excised lariat intron product of eukaryal 
pre-inRNA splicing. Dbrl, hke Q/jPnkp, is a Mn^^-de- 
pendent phosphodiesterase, and mutational analysis of 
Dbrl highlights the essentiahty of many of the conserved 
side chains that comprise the active sites of Cf/jPnkp 
and A.-Pase (22). 
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